Provenance in Dynamic Data Systems

نویسندگان

  • Jing Zhang
  • H. V. Jagadish
چکیده

Most digital data sets are subject to modifications. For example, scientific data may be updated according to the new experimental results, and sales data updated periodically according to new sales made. We often have data derived from these digital data sets. Our concern in this paper is the provenance of such derived data. Can we explain what a particular derived datum depends on, even if a value used in its derivation has since been modified. Can we determine if a particular derived value is still valid without performing full view maintenance. Questions of this sort are likely to arise when we derive results from modifiable data. We present in this paper an overview of problems that arise in this context, with regard to fine-grain data provenance, and outline solutions to some of these problems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ProvDS: Uncertain Provenance Management over Incomplete Linked Data Streams

Data processing in distributed environments is often across heterogeneous systems, bearing the need to exchange provenance information, such as, how and when data was generated, combined, recombined, and processed. Distributed systems involve multiple participants and data sources which can produce unreliable, erroneous data. Besides, there maybe exists oceans amount of data to deal with, e.g.,...

متن کامل

Towards a Universal Data Provenance Framework Using Dynamic Instrumentation

The advantage of collecting data provenance information has driven research on how to extend or modify applications and systems in order to provide it, or the creation of architectures that are built from the ground up with provenance capabilities. In this paper we propose a universal data provenance framework, using dynamic instrumentation, which gathers data provenance information for real-wo...

متن کامل

Facilitating Trust on Data through Provenance

Research on trusted computing focuses mainly on the security and integrity of the execution environment, from hardware components to software services. However, this is only one facet of the computation, the other being the data. If our goal is to produce trusted results, a trustworthy execution environment is not enough: we also need trustworthy data. Provenance of data plays a pivotal role in...

متن کامل

Dynamic Provenance for SPARQL Update

While the Semantic Web currently can exhibit provenance information by using the W3C PROV standards, there is a “missing link” in connecting PROV to storing and querying for dynamic changes to RDF graphs using SPARQL. Solving this problem would be required for such clear use-cases as the creation of version control systems for RDF. While some provenance models and annotation techniques for stor...

متن کامل

To Trust or Not to Trust? Developing Trusted Digital Spaces through Timely Reliable and Personalized Provenance

Organizations are increasingly dependent on data stored and processed by distributed, heterogeneous services to make critical, high-value decisions. However, these service-oriented computing environments are dynamic in nature and are becoming ever more complex systems of systems. In such evolving and dynamic eco-system infrastructures, knowing how data was derived is of significant importance i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011